Entanglement of assistance 
and multipartite state distillation 
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We find that the asymptotic entanglement of assistance of a general bipartite mixed state is equal 
to the smaller of its two local entropies. Our protocol gives rise to the asymptotically optimal EPR 
pair distillation procedure for a given tripartite pure state, and we show that it actually yields EPR 
and GHZ states; in fact, under a restricted class of protocols, which we call "one-way broadcasting", 
the GHZ-rate is shown to be optimal. 

This result implies a capacity theorem for quantum channels where the environment helps trans- 
mission by broadcasting the outcome of an optimally chosen measurement. We discuss generalisa- 
tions to m parties, and show (for m = 4) that the maximal amount of entanglement that can be 
localised between two parties is given by the smallest entropy of a group of parties of which the one 
party is a member, but not the other. This gives an explicit expression for the asymptotic localisable 
entanglement, and shows that any nontrivial ground state of a spin system can be used as a perfect 
quantum repeater if many copies are available in parallel. 

Finally, we provide evidence that any unital channel is asymptotically equivalent to a mixture of 
unitaries, and any general channel to a mixture of partial isometries. 

Keywords: entanglement of assistance, quantum error correction, feedback control, unital channels, localis- 
able entanglement, entanglement length 



I. MULTIPARTITE QUANTUM STATES 

One of the great ongoing programmes of quantum 
information theory is the classification of multipartite 
(pure) quantum states ip AB --- z ; and the understanding of 
the possible transformations between them allowing only 
local operations and classical communication (LOCC). 
As entanglement presents a resource that can be used 
for e.g. quantum communication, it is especially inter- 
esting to study the asymptotic Shannon-theoretic limit. 
In this scenario, a few parties hold asymptotically many 
copies of identical states distributed among them, only 
joint operations between the particles at the same site 
and classical communication between the parties are al- 
lowed, and the conversion of states occurs with vanishing 
errors in the asymptotic limit. 

In the bipartite case, this question is well understood: 
every pure state ip AB — \ip)(ip\ AB is asymptotically re- 
versibly equivalent to maximally entangled (EPR) states, 



can we quantify the exact yield of the useful EPR states, 
but the latter serve as a normal form in general. 

For multipartite states, the situation becomes more 
complex: there does not exist any longer a single state 
suited as a "gold standard" , e.g. it is quite evident that 
an EPR state \$>2) AB can never be equivalent to any 
quantity of Greenberger-Horne-Zeilinger (GHZ) states of 
three parties, 
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at rate E(ip) = S(tp A ) = S(A), the entropy of entangle- 
ment [j (Note our notation convention: tp A — Tr^i/)' 45 
is the restriction of the state tp AB to A.) So, not only 
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So one has to aim at a "(minimal) reversible entangle- 
ment generating set" (MREGS) Q, about which little 
is known, except that apart from the e asy candidates of 
|$2> AS , |*2) BC , |$ 2 ) AC and \T) ABC II, an MREGS 
has to contain at least another state, and possibly in- 
finitely many. See for an instructive case study. 

Usually the two parts of the multi-party entanglement 
programme — classification and possible transformations 
— are viewed as one, but as we have seen, the first is re- 
ally much harder: this is because it involves studying the 
transformations between pairs of states which are asymp- 
totically reversible. 

In this paper, we have a more modest goal: we want 
to go from (many copies of) a given state to particular, 
interesting states, like the EPR and GHZ state. To be 
precise, one would like to "distill" as many as possible 
of these target states, with high fidelity in the limit of 
n — > oo, and will care primarily for optimality of these 
processes and not so much for reversibility. 



2 



II. THE TASK(S) 

Given many copies of a (pure) tripartite state ip ABC , 
which "standard" entangled states like EPR states $2 be- 
tween any pair of parties, or GHZ states T can the three 
parties distill by local operations and classical communi- 
cation (LOCC)? 

We shall focus on three scenarios in succession: first, 
we study optimal distillation of EPR states between a 
given pair of players from a tripartite state; second we 
recast the protocol as one of distilling EPR and GHZ 
states at the same time, and show that for this target 
set, and a restricted class of protocols it gives optimal 
yield; and third, we look at m-partite states and how 
many EPR states between a prescribed pair of players 
can be distilled by local measurements on the other m — 2 
parties. 

Scenario 1 was studied in 0, 0, |23| under the name 
of "entanglement of assistance" in a non-asymptotic set- 
ting: given a mixed state ip AB = Tt c (\^ ABC ) (^ ABC \) 
shared between A and B, there exists a unique purifica- 
tion ip ABC up to local unitary operations on C, and the 
question was asked how much EPR-type entanglement 
can be created between A and B when C is doing local 
measurements and communicates the results to A and 
B. In this paper we completely solve that question in 
the asymptotic setting. 

About scenario 2 very little has appeared in the liter- 
ature, except for upper capacity bounds, e.g. pij . and 
a few (qubit) protocols which however remain largely in 
the single-copy setting Q, Q ■ 

The third scenario has been studied in the context 
of spin chains under the name of localisable entangle- 
ment 33]. The present work will reveal some intriguing 
connections between the concept of entropy of a block of 
spins and the entanglement length in spin systems. 

The main results are as follows: 

Theorem 1 Given a pure tripartite state ip ABC , then 
the optimal EPR rate distillable between A and B with 
the help of C under LOCC is 



measured result, followed by local operations on A n and 
B n , which effects the transformation 



oalj,ABC\ 



min{ S(A), S(B)}. 



( Our notation is such that the first two parties obtain 
EPR states, and the remaining is the helper. ) This is the 
asymptotic entanglement of assistance 

Writing the tripartite state as \ij;) ABC = 
\/<h\' l l J i) AB \i) C i with orthogonal \ j) — corresponding 
to to a pure state decomposition ip AB = J^j Qjl^j^jl^ 

let E — J2j QjEfyj) be the average entan- 
glement of the pure state decomposition. Define 
X = min{S(A),S(B)}-E. 

Theorem 2 Let ip ABC be a pure tripartite state. Then, 
for e, S > and sufficiently large n, there exists a protocol 
involving only an instrument on C n and broadcast of the 



ABC\® n 



with fidelity 1 



frABC\ 



5) 



AB\ 



im{E-S) 



Observe that the minimal value of E above is the en- 
tanglement of formation Ep(ijj AB ) of the mixed state 
ip AB between Alice and Bob 4]. In the limit of many 
copies we have to substitute the entanglement cost 
Eci^ 3 ) This outcome is better than theorem^ as 
we can always (irreversibly) turn GHZ states into EPR 
states, and achieve the previous EPR-rate. Theorem |5| 
in section shows that the corresponding GHZ-rate, 
min{S{A),S(B)} - E c (ip AB ), is indeed optimal under 
an important class of protocols. 

Theorem 3 For an m-party state ip ABC ^— c m-2 ^ ^ e p_ 
timal rate R of EPR states distillable between A and B 
with the help of the Ci via LOCC, satisfies 



R < min 

SC{Ci,...,C„ 



5 (AS). 



(1) 



This bound is an equality for all m, which is therefore the 
expression for the asymptotic version of the localisable 
entanglement. We prove this here for m = 4; the general 
proof is given in \2(\ l. Furthermore, the bound is achieved 
by a protocol where each helper Ci takes a single turn in 
which he measures his state and communicate the result 
to the remaining parties. 

Observe that the right hand side in eq. Q is the min- 
imum pure state entanglement over all bipartite cuts of 
the systems which separate Alice and Bob. 

The remainder of the paper is structured as follows: in 
section IIIII we present a protocol and prove theorem ^ 
Section llVl presents an application of this first result to 
quantum transmission with a classical helper in the chan- 
nel environment. In section [V] we show how to make 
the basic protocol coherent, such that it also gives GHZ- 
states, and prove theorem |2] Its GHZ-rate we prove to 
be optimal under a subclass of protocols which we call 
one-way broadcast. Then, in section IVTl we generalise the 
basic protocol to more than one helper (proof of theo- 
rem |21 for m = 4) , and discuss the connection between 
the concept of localisable entanglement and entropy of 
blocks of spins. Finally, in section IVIII we discuss pos- 
sible applications and/or extensions of our main result 
to asymptotic normal forms of quantum channels, and 
conclude in section IVIIII 



III. ASYMPTOTIC ENTANGLEMENT 
OF ASSISTANCE 

In 0, 0| , the following quantity was introduced under 
the name of entanglement of assistance of a bipartite 
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mixed state p AB (with purification ip ABC ): 



E A (p AB ) :=E A (iP ABC ) 



max < y y pjE(ijj 



AB 



)--p ab = Y,p^ 



AB 



The idea is that by varying a measurement (POVM) on 
C, the helper Charlie can effect any pure state ensem- 
ble decomposition p AB = ^2iPiip AB for Alice and Bob's 
state |2flj . In this sense, Ea gives the maximum amount 
of entanglement obtainable between Alice and Bob with 
the (remote) help from Charlie. Of course, we are pri- 
marily interested in the operational asymptotic rate of 
EPR states, E A , which will turn out to be given by the 
regularisation of Ea- 



lim -E A (p 

n — >oo TL 



Now we argue the upper bound, E A c (p AB ) < S(A), 
which was noted in [T^. operationally: whatever can be 
done under three-party LOCC, is contained in protocols 
which allow general transformations on BC and LOCC 
with respect to the cut A vs. BC. But in this latter 
formulation, we are in a bipartite pure state situation, for 
which the maximum yield of EPR states is well-known 
to be S(A) Q| . By an identical argument, we have the 
same bound with S(B) 7 and hence we obtain 

E%(p AB )<mm{S(A),S(B)}. (2) 

Note that this upper bound is not additive under gen- 
eral tensor products (compare 01) • consider strictly 
mixed states p AB = \0)(0\ A <8> p B and a A ' B ' = a A ' <g> 
|0)(0| s ; they have both E A = 0, because the entropy up- 
per bound is 0. However, E A {p® a) > 0. A less trivial 
example of superadditivity of E A is given in |12| for two 
copies of the same state. Here is a very easy one: 

Example 4 (Superadditivity of E A ) Consider the 3- 
qutrit determinant (or Aharonov) state 

\a) ABC = -^(|012) + |120) + |201)-|210)-|102)-|021)). 
V6 

The restriction a AB is proportional to the projector onto 
the 3 x 3-antisymmetric subspace, and it is well known 
that this subspace consists entirely of "singlets", i.e., 
states \v)\v') - \v')\v), with (v\v') = 0. Hence E A (a) = 1 
(and by the way also E F (a AB ) = E c {a AB ) = 1). How- 
ever, E A (a <£> a) > 2.5, since a® a can be presented as a 
uniform mixture of states (U Al <g> U A2 <g> U Bl <g> U B2 )\tp), 
with 

^a iA2Bi b 2 ^ _^((|oi)-|10»^ 1 - Bl ® (|01)-|10)) A2B2 
v8 



(|12)-|21))^®(|12)-|21)) 



A 2 B 2 



It is easily established that the Schmidt spectrum of this 
state is [|, |, |, |, |, |] , so its entropy of entanglement 
is E((p) = 2.5. 



This example contains a valuable insight: for a given 
single-copy decomposition of p, one can form superpo- 
sitions of tensor products of component states, and in- 
crease the entanglement. A little consideration reveals 
that this is so because the tensor products have some 
local distinguishability. Hence, in the general case we 
should try to enforce local distinguishability of the states 
we put in superposition. 



Proof of theorem^ Write \%P) ABC = £\ ^/qj\ipj] 
with an orthogonal basis of C . Let 



AB 



XA 
XB 



(3) 
(4) 



denote the Holevo information of the given ensembles; 
observe the common term 

5>S(V>/)=^ = 5>S(Vf). 



We may assume without loss of generality that S(A) < 
S(B), hence xa<Xb- 

For n copies of ip, the sequences J — ji . . .j n , and 
consequently the states \J) = \ji) ■ ■ ■ \ j n ), fall into (poly- 
nomially many) type classes: we say that J is of type 
P (which is a probability distribution on the letters j) 
if j occurs exactly nP(j) times in J. This is relevant 
because the probability qj — qj 1 ■ ■ ■ qj n , the product of 
the letter probabilities, of a sequence is constant across 
a type class. We can write the state as 



(IV-) 



ABC\® n 



E 

j 



where \ipj) = | Vj'i ) ' ' 1 an< i ^™ = ^1^2 ■ ■ ■ A n are 

Alice's n copies of system A, etc. The goal of Charlie's 
strategy will be to project this state down to a super- 
position of terms \tpj} A B which are as orthogonal as 
possible on both Alice's and Bob's systems: because then 
Alice's (say) reduced state is roughly an orthogonal mix- 
ture of the states ip A , and we can easily calculate its 
entropy. 

More precisely, Charlie's measurement consists of two 
steps: first, a projection into the subspaces of constant 
type, say P: 



n(P) := span{ J : J is of type P}. 



Note that, 

\\p-q\\ 



< 



for any i] > 0, with probability 1 — e, 
r/, if only n is sufficiently large (otherwise, 



abort). Here, || • |ji is the total variational distance (or 1- 
norm distance) of probability distributions. By Fannes' 
inequality (stated below as lemma [SJ, then 



s - E p (j) s &f) * xa 



6. 
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Second, for each such type P, letting N = [2"(x^- 25 )J , 
define states depending on a set J = {J^°\ . . . , J^ N ~ 1 ^} 
of sequences of type P and a number a = 0, . . . , N — 1: 



N-l 
v /3=0 



«/3 /N 



\JW). 



Clearly, with an appropriate constant c > 0, the collec- 
tion (§\tj{a))(tj(a)\) Ja forms a POVM on the type 

P subspace, i.e., these operators sum up to H(P). This 
is the second (rank-l!) POVM of Charlie. 

By the Holevo-Schumacher- Westmoreland theorem, 
stated as lemma below, the majority of the sets J are 



good codes for the classical-quantum channel j 



and simultaneously for j i — ► ip? For such a J, and all 
a, consider the projected state fi?) of AB (up to normal- 
isation), dropping the superscript n from the registers: 



1 



JV-1 



Because Alice and Bob have good decoders for /3, i.e., 
POVMs (Dfi)p and they can locally extract (3 

with high reliability. We can always think of these mea- 
surements as (local) isometries, for example for Alice 




and a similar expression Vb for Bob. This pair of uni- 
tarics takes the state \-ff) to 



{V A ®V B )\&) 




AB 



B'uaC" 



Since we have (with high probability) a good code both 
for Alice's and Bob's channels, we expect that 



1 



JV— 1 



-2niaf3/N 



\ A n B" 



\A' I 



\ B' 



which indeed can be shown (see the proof of theorem [21 
below). Here we need something only slightly weaker: 

When tracing over BB', we can assume that Bob's 
POVM is actually performed (i.e., the register B' ob- 
served); using the fact that both Alice's and Bob's 
POVMs have average error probability < e, we get 



< 2 ■ 2e, 



with the trace norm || • ||i on (density) operators. Fur- 
thermore, by the gentle measurement lemma {7\ stated 
below for convenience, this yields 







A' 



< 4e + V8e < 7y/e. 



A' 



— nS 



Hence, for the entropy (choosing e and r\ small enough), 

s(§ A ) = S(i) A ) 

= logN + nJ2 P{j)E{i}jf B ) - nS 

3 

> n(S(A)-AS), 

where we have used the Fannes inequality, the fact that 
all j(P> have the same type P, and Fannes inequality 
once more. □ 

Lemma 5 (Fannes inequality jl3l |) For any two 

states p and a on a d- dimensional Hilbert space: if 
\\p- o-\\i < e, then \S(p) - S(a)\ < 17(e) + Kelogd, with 
r}(x) — — xlogx and a universal constant K. □ 

Lemma 6 (HSW theorem [18]) For a classical- 
quantum channel W : x 1 — ► W x on the Hilbert space Ti, 
and a probability distribution P, let U^' be i.i.d. uni- 
formly random from the sequences of length n of type 
P. Then for every e, S > and sufficiently large n, if 
logN <n{x{(P(x),W x )}-5), 



Pr 



{C = (?7 (l) )ili is e-goodX 



> 1 - e. 



Here we call a collection of codewords e-good if there 
exists a POVM (A)ili on H® 71 such that 

1 N 

_£Tr(W- w A)>l-e. 



(In this form, the theorem is proved in \lt\L ) 



□ 



Lemma 7 (Gentle measurements [35]) Let p be a 

state (actually, p > and Tr p < 1 are enough) and 

< X < 1, such that Tr(pA) > 1 - e. 

Then, \\p- y/Xpy/X^ < \/8e. □ 



IV. CHANNEL CAPACITY WITH CLASSICAL 
HELPER IN THE ENVIRONMENT 

Gregoratti and Werner [l4| have considered the follow- 
ing channel model with helper in the environment: 



U: H A 



Hb®H c , 



described by an isometry from Alice's input system A to 
the combination of Bob's output system B and the en- 
vironment C. Assume that the environment system may 
be measured and the classical results of the observation 



5 



be forwarded to Bob — attempting to help him in error 
correcting quantum information sent from Alice. 

We are interested in the quantum capacity of this sce- 
nario from Alice to Bob, in the asymptotic limit of block 
coded information (and collectively measured environ- 
ment). The setup is illustrated in figure Q 



\ip) = (1 ® U)\<j)) A A , according to theorem ^ Then we 
can define a new quantum channel 

T' : A n — ► B n B' 

<p ^^^[(M, ® t)(U<pU*)] <g> \x)(x\ B \ 




the 



FIG. 1: Alice prepares an input to (many copies of) the isom- 
etry U, which gives part of the state to Bob and part to 
Charlie. The latter measures a POVM M on his system and 
classically communicates his result x to Bob, who executes a 
unitary V x depending on Charlie's message to recover Alice's 
sent state. 



which, by theorem ^ has on the test state \<p 
coherent information [30| 

I{A')B n B') = S(B n B') - S(A'B n B') 

>n(mm{S(p),S(T(p))}-5 



Invoking the quantum channel coding theorem |8 u25ll31| . 
there are block codes for T" achieving this rate asymp- 
totically. □ 



GHZ DISTILLATION 



We want to mention a related model, discussed by Hay- 
den and King |l7|. where the objective is to transmit 
classical information rather than quantum. Of course, 
the corresponding capacity will usually be higher, since 
the helper in the environment can learn part of the mes- 
sage and forward this information to Bob. 

Theorem 8 The environment-assisted capacity of a 
noisy quantum channel T : A > B is 

C A (T) = max p min{ S(p),S (T(p)) } . 

The same capacity is obtained allowing unlimited LOCC 
between Alice, Bob and Charlie. 

Proof. Let us first deal with the converse: whatever the 
detailed strategy, Alice will eventually input the A ra -part 
of some state \$>) A A into the channel (there is no loss of 
generality in assuming that the players keep all ancillas 
around, and hence the state pure). After the channel, 
the three players share the state 



IV-) 



A'B n C n 



= (iig)E/® n )|$; 



A' A 71 



By the same argument as for the upper bound in the- 
orem ^ the pure state entanglement between Alice and 
Bob cannot exceed either of 



S(V> A ") < E S ($ Ak ) = E s (® Ak ) < nS(p A ) and 

k 



k 



Now we will show how to modify the protocol of theo- 
rem^by "making it coherent" (after the model of [sLUl^) 
such that part of its yield is in the form of GHZ states. 
We shall freely use the notation introduced in the proof 
of theorem ^ 

Proof of theorem By possibly embedding C into a 
larger space, we can write \ip) ABC = J2j \/Qj\^Pj) AB \j) C '> 
for any pure state decomposition of ip AB into an ensemble 

{</.,<.•/'''!• 

Consider sets J = { . . . , J^" 1 )} of N = 
I 2"(xa-25)J type P sequences of length n, with, as before, 
\\P — q\\i < V- Now construct the projectors 



N-l 

->j = E IV(«)X*^(a)l. 



a=0 



so that we have a POVM (c9j) a coarse-graining of 
the measurement used in the proof of theorem ^ 

Charlie's measurement is again in two parts: first he 
measures the type subspace LT(P), and P is close to q 
as above with high probability (otherwise abort). Then 
he measures (cQj)', if the operator <dj acts, the pro- 
jected state is (dropping the superscript n from the reg- 
ister names) 



N-l 



10 



ABC 



\AB 



|J(«) C 



/3=0 



S(^ B ) < E S (^ Bk ) = 2^ S ( T (® fe )) - nS ( T (P ))i Most of the sets J are good codes for both the channels 

k 



k 



with p A = ±Y, k ® Ah - 

For the direct part, let p be the optimal input state for 
the maximum in the theorem and denote a purification 
of it \<f)) A A , which is used in the following as "test state" . 

Let Charlie pick, for some n, an optimal measurement 
(M x ) x for the entanglement of assistance of (n copies of) 



j i — ► ipf, i>f- Hence, with large probability, we can use 
the same local isometries Va and Vb as before to extract 
(3 with little state disturbance, and a local unitary Vc 
mapping \J^') *— > \0). These isometries map \Q to 



10 



AA'BB'C' 



1 



JV-1 
v 0=0 



\A'\a\B'\ 
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and because the are all of the same type, they are 
permutations of each other, so the states \ij)jw)) AB can 
be taken to a standard state \4>~j) AB — sa Y> the lexico- 
graphically first sequence of type P — by (controlled) 
permutations of the n subsystems. So, they arrive at the 
state 



of eq. J2J|. The smaller of these numbers clearly is just 



\Z) 



AA'BB'C' 



AB 



1 N ~ l 



\B'\o\C' 



This concludes the proof, since the rate of N is asymp- 
totically xa, and the rate of -E(-04 B ) is asymptotically 



E. 



□ 



Remark We have presented the POVMs of theorem Q] 
and |2 in the simplest possible terms. One can also min- 
imise these POVMs, by not taking all sets J. This can be 
done as shown in [3, Qlj, yielding for theorem ^ a rank-1 
measurement with ps 2 nS ^ elements; for theorem [21 the 
POVM has ps 2 n ^~ x ^ operators. □ 
Now we show that theorem El is in a certain sense op- 
timal: namely, it gives the largest GHZ rate among all 
protocols which consist only of (i) a local operation with 
measurement at C, (ii) sending the classical information 
obtained in the measurement to A and B, and (iii) local 
operations of A and of B depending on the message. In 
particular, we allow no feedback communication and no 
communication between Alice and Bob. These are severe 
restrictions, but at least the protocol from theorem is 
of this type: we call it one-way broadcast. 

Theorem 9 Under one-way broadcast protocols from C 
to AB, the asymptotic GHZ rate from the state tp ABC 
cannot exceed mm{S(A), S(B)} — Ec{^p AB ) ■ 

Proof. We show actually a bit more: the rate of three- 
way common randomness distillable by such protocols 
is asymptotically bounded by the same number. This 
problem was studied in 9] for two players with one-way 
communication, and the relevant observation here is that 
with one-way broadcast, the task is equivalent to two si- 
multaneous two-player common randomness distillations: 
from C to A and from C to B. 

What it is about is the following: the sender C and 
the receiver (A or B) initially share a quantum state, 
and by local operations and one-way classical commu- 
nication want to distill a maximum amount of shared 
randomness, which however has to be independent of the 
communicated message(s). 

A particular protocol for doing this is to distill GHZ 
states by a one-way broadcast protocol and then all three 
measure these states in the computational basis — by pu- 
rity of the measured state, the resulting perfect shared 
randomness is independent of everything else in the pro- 
tocol. 

It was shown in [jj that the maximum rate achievable 
between C and A is the maximum of eq. (|3J) — actu- 
ally regularised for many copies of the state; and simi- 
larly between C and B the — regularised — maximum 



x{S(A),S(B)}-E c (ip 



AB\ 



□ 



Remark In general, the GHZ rate obtainable from a 
pure state \ip} ABC by general LOCC has the easy upper 
bound min {S{A),S(B),S(C)}. Remarkably, our proto- 
col of theorem[21 achieves this in for broad class of states, 
namely when one of the reduced states ip AB , ip BC or ip AC 
is separable. □ 

Example 10 (Groisman, Linden, Popescu [15]) 

Consider the family of states 

\T a ) ABC = a\0) A \$+) BC + 

with < a < (3 and a 2 + 1 = 1, interpolating between 
a state |$ 2 ) BC {a 2 = 0) and |r) ABC (a 2 = 1/2; up to 
local unitaries). Observe that it is certainly possible to 
obtain 1 EPR state between B and C from T Q . 

In [15j it is observed that the local entropies of T a are 
consistent with the hypothetical existence of an asymp- 
totically reversible transformation into 



H 2 (a 2 ) x \T) ABC and [l-H 2 (a 2 )] 



x 1$. 



(5) 



per copy of the state, but the authors present heuristic 
arguments for its impossibility. 

Let us see what our results tell us about the distilla- 
bility of GHZ and EPR states: by applying theorem [3 
with B (or equivalently C) in the role of the helper, we 
obtain (since T AC is separable) a GHZ rate of #2 (a 2 ), 
but no EPR states. The GHZ rate is evidently optimal 
under general LOCC protocols, as it coincides with Al- 
ice's entropy (i.e., her entanglement with the rest of the 
players). By applying theorem with A as the helper, 
we have to calculate the entanglement cost of the Bell 
mixture T BC , which happens to be known by [34| and 
IH: 



E c (r BC ) = E F (r BC ) = n 2 {- - ,,i 

Hence we get distillation of 
1 



1-Ho 



a/3 



x\T) ABC and H 2 ( | - a(i ) x |$ 2 ) ( 



per copy of the state, and theorem|51shows that this GHZ 
rate is optimal among all one-way broadcast protocols 
from A to BC. Note that the GHZ rate is slightly worse 
than the one stated in eq. JSJ, of which it is unknown if 
it can be achieved. 

Example 11 (VK-State) Another interesting example 
is provided by the W-state Q 

|W) = -^(|001) + |010) + |100)), 

which is interesting because it cannot be converted to a 
GHZ state even probabilistically (on a single copy). 
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Theorem ^ tells us that any two parties can obtain a 
rate of H 2 (^) ~ 0.918 EPR states, with assistance from 
the third. Since two EPR pairs between different players 
can be converted into a GHZ state, we can obtain a GHZ 
rate of at least \-H 2 (\) » 0.459. 

However, using theorem[21 we can do a bit better: the 
entanglement of formation of any two-party reduced state 
is evaluated with the help of |36j, and we get rates of 
H 2 (\) - H 2 ((l - y / 5/9)/2) w 0.368 for GHZ states, and 
H 2 ((l - y/t/9)/2) w 0.550 for EPR states between any 
pair of players. Converting the EPR states to GHZ's 
as before, we arrive at an overall rate of GHZ states of 
H 2 {\) - \H 2 {{\ - y/5/9)/2), which is « 0.643. 



VI. SINGLET DISTILLATION WITH THE 
HELP OF MANY (DISTANT) FRIENDS; 
ASYMPTOTIC LOCALISABLE 
ENTANGLEMENT 

Consider now the m-party generalisation of the task 1: 
distillation of EPR pairs between A and B with the help 
of Ci, . . . C m _2 from an m-partite pure state, by LOCC. 

In analogy to the upper bound eq. J2j, we can easily ob- 
tain an upper bound on the achievable rate R in this sce- 
nario: surely, the distillable entanglement can only go up 
if we allow Alice to team up with a subset S of the helpers 
Ci, and Bob with the complement S = {1, . . . , m — 2}\<S, 
such that all collective operations on AS and on BS are 
allowed, and LOCC between these two groups. Thus, 
R < S(AS), and we get eq. (HJ, 



R < minS(AS) 



min S(BS). 



[Note that this reduces to the inequality J3J) for m = 3.] 
In it was shown that whenever the right hand side 

in the above equation is non-zero, then one Alice and 

Bob can, with LOCC help from the other parties, distill 

EPR pairs at non-zero rate. 

It turns out, however, that the right hand side is 

achievable for any m, and we show here how to do it 

in the case of m = 4 (the general case requires different 

arguments and is solved in ppf). 

Proof of theorem^ for m = 4. Only the achievability of 
the minimum cut entanglement is left to be proved. For 
to = 4, i.e. two helpers Charlie and Debbie, this means 
we are looking at R = mxa{S(A), S(B), S(AC), S(BC)}. 

Our goal will be to construct a measurement on 
D n such that for the (majority of) projected states 



with arbitrary 5 > and sufficiently large n. I.e., we 
want to preserve (up to a small loss) the minimum cut 
entanglement, while disengaging Debbie. If we succeed 
doing this, we can invoke theorem ^ for the residual tri- 
partite state. 



Pick any basis of D, so that we can write \ip) ABCD — 
y/'lj\i J j) ABC \j) D ■ As in the previous proofs, we have 
reduced state ensembles with Holevo informations xa, 
Xb, Xac and xbc- By possibly swapping A and B, we 
may assume that xa < Xb ■ Invoking monotonicity of the 
Holevo information under partial trace, xa < Xac and 
Xb < Xbc, we are left with one of the following ordcrings 
of the four quantities: 

XA < XB < XAC < XBC: 
or XA < XAC < XB < XBC- 

Define xo : = nfinjxB, Xac} and consider random 
codes J of rate xo — 5 where, in slight variation to the 
proof of theorem ^ the codewords are drawn from the 
distribution q® n — this is the original form of the HSW 
theorem and the conclusion of lemma holds true. 
Now construct a rank-1 measurement on D, in the same 
way as we did there. What can we say about the pro- 
jected state 



\ABC 



-2iriaf3/N 



ABC 7 



For the bipartition B\AC, essentially the same argu- 
ment from that proof shows that with high probability, 
the entanglement of 7? is 

E(^ AC ) > n(min{S(B),S(AC)} - S) > n(R - 6). 

For the bipartition A\BC this only works when xa — 
XOj to make the rate of the codes smaller than either 
Holevo information. So, let us assume xa < XO; and 5 so 
small that xa + 8 < Xo — The rate of the code is still 
smaller than xbc, so there exists a (hypothetical) "local" 
decoding of j3 from the register BC. I.e., with respect to 
the bipartite cut A\BC, the state \&) is equivalent to 



\ABC 



1 



-2-niaf3/N 



"7 



\ABC\f3\BC 



where the approximation has the same quality as in the 
proof of theorem ^ But then, we have 



Now we can conclude the proof by invoking lemma 1121 
below, which states that for a random code (which is 
what the POVM will select) the average on the right hand 
side is ~ (ij) A ^ n . Hence, and using Fannes' inequality 
once more, 



E{d A \ BC ) > n(S(A) -S)> n(R - S), 
and we are done. □ 

Lemma 12 (Density sampling [2]) Consider the en- 
semble \(qj,Pj)} of states on a d- dimensional Hilbert 
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space, with average density operator p and Holevo in- 
formation X- Let independent and identically distributed 
random variables X±,...,Xn, drawn from the states 
P-J = Pji ® • • • ® Pj n with probability qj = qj 1 ■ ■ ■ qj n . 
Then, for every e, 8 > 0, N > 2 n ( x+s \ and sufficiently 
large n, 



N 



1l a 

fc=l 



< e 



with probability > 1 



□ 



Theorem |3 yields an exact expression for the asymp- 
totic localisable entanglement [33 (except for the techni- 
cal issue that there one has an infinite number of parties, 
whereas here we considered only finite m). The concept 
of localisable entanglement was introduced in the context 
of quantum spin systems, and allows to define a notion 
of entanglement length when these spins are part of a 
lattice with a given geometry. More precisely, consider 
the maximal bipartite entanglement that can be localised 
between two blocks of spins as a function of the distance 
between the blocks; typically this function is decaying 
exponentially with the distance, exp(— L/£), and the en- 
tanglement length is defined as the constant £ in this 
exponent. Theorem gives the exact expression for the 
localisable entanglement between two blocks if asymptot- 
ically many realisations of these systems are available and 
joint local operations can be performed. If furthermore 
we are considering a state with infinitely many particles 
and translational symmetry (which is the usual case in 
condensed matter systems), then the strong subadditiv- 
ity property of the von Neumann entropy enforces the 
entropy of a block of spins to grow when more spins 
are included in the block. It follows that the asymp- 
totic localisable entanglement in such systems between 
two blocks is exactly given by the minimal entropy of 
these blocks, which proves that the upper bound given 
in is actually the exact value for the localisable entan- 
glement in the asymptotic limit. This is very surprising: 
the magic of doing local asymptotic operations allows to 
create entanglement between two blocks that are arbi- 
trary far from one another, and the rate at which this 
can be done is independent of the distance. This implies 
that any nontrivial ground state can be used as a perfect 
quantum repeater if many copies are available in parallel. 
The amount of entanglement that can be localised over 
these arbitrary distance is solely related to the entropy of 
a block of spins and not dependent on the distance. It is 
interesting to contrast the translationally invariant case 
to the one with random bond interactions |27| : in the 
latter case, the minimal entropy over all bipartite cuts 
will decrease algebraically with the distance between the 
blocks. This indicates that the entanglement in the case 
of random systems is essentially different than in the case 
of translationally invariant ones, something that is not 
revealed by looking at the entropy of a block of spins. 

The problem of calculating the entropy of a block of 
spins has recently attracted a lot of attention in con- 



densed matter physics [l9( , where it was shown that this 
entropy, in the case of ground states of 1-dimensional 
systems, saturates to a finite value or increases logarith- 
mically as a function of the size of the block, depending 
on whether the system is critical or not. The present 
work provides an operational meaning to these calcula- 
tions in the sense of entanglement theory: this entropy 
quantifies the amount of entanglement that can be cre- 
ated at arbitrary distances if this ground state would be 
used as a quantum repeater. In higher dimensional sys- 
tems, the entropy of a block of spins grows as the bound- 
ary of that block, and therefore there is no bound on the 
amount of EPR-pairs that could be localised between two 
far away regions by doing joint local measurements on all 
the other spins; this is again the consequence of the fact 
that the asymptotic operations allow for perfect entan- 
glement swapping in multipartite states. 



VII. ASYMPTOTIC NORMAL FORMS OF 
UNITAL & GENERAL QUANTUM CHANNELS 

Based on the well-known linear isomorphism between 
completely positive and trace preserving maps and a set 
of quantum states |2l| : 



T : A—f B 



A'B 



p T = (id <g> T)<j> 



A' A 



with a pure state \4>) A of Schmidt-rank dA = dim 7^, 
we can interpret our findings in theorem ^ as statements 
on quantum channels. Note that T is an isometry or 
unitary, if and only if the state px is pure, and the corre- 
sponding states of different isometries are equivalent to 
each other up to unitaries on the system B. We shall use 
this isomorphism in the following with a maximally en- 
tangled state 4> A of Schmidt rank dA, unless specified 
otherwise. 

Let T be a unital quantum channel on a system, i.e. 
mapping the identity on A to the identity on B (and 
assume input and output system to be of the same di- 
mension d = dA = g?b for the moment). The corre- 
sponding state has the properties p B = Tr a' Pt = 3I, 
p A = Tr^ pt = 3I. Thanks to theorem ^ we know that 
in the asymptotic scenario the entanglement of assistance 
of pT is given by log d. 

Clearly, the unital channels (and equally, the states 
with maximally mixed marginals as above), for a con- 
vex set, and the question of determinin g it s extremal 
points has attracted quite some attention J22j ■ The clas- 
sical analogue of this problem is about doubly stochastic 
maps which, thanks to Birkhoff 's theorem, are known to 
be exactly the convex combinations of permutations. For 
quantum doubly stochastic maps (another popular name 
for unital trace-preserving channels) the "obvious" gen- 
eralisation is wrong: there exist unital channels which 
are not convex combinations of unitaries. Under the 
Jamiolkowski isomorphism, this means that the state 
Pt is not a convex combination of maximally entangled 
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states; a specimen of this type we have actually studied 
in example 01 

However, theorem ^ points a way to resolving this 
unsatisfactory state of affairs in the asymptotic limit: 
since the asymptotic entanglement of assistance of pT 
is logci, we can say that p® n is well approximated by 
a convex combination of "almost" maximally entangled 
states in the sense that their entropies of entanglement 
are n (logci — S) for arbitrarily small 5 > and suffi- 
ciently large n. We would like to deduce from this that 
T®" is well approximated by a convex combination of 
unitaries (in the appropriate norm), but unfortunately 
the latter is really a stronger statement since it would 
give an approximation of p® n by a convex combination 
of states that have high fidelity to some maximally entan- 
gled state. And that is not even mentioning the issues of 
the different norms to be used for comparing states and 
for channels. 

Similarly, for a general channel, and general <f> A A , the 
state pj, n can be restricted to the typical subspaces |2^| 

of (px')^ U on Alice's side, and of (p B )® n on Bob's 
side, without changing the state very much. This pro- 
jected state lives in a Da x D^-dimensional system, with 
D A S3 2 nS( - A ^ and D B S3 2 nS{ - B \ By theorem [T] it is well 
approximated by a convex combination of pure states 
with entanglement n(mh\{S{A), S(B)} — 5) , which again 
is too weak to say that the components have high fidelity 
with maximally entangled states. 

Nevertheless, we take these observations as positive ev- 
idence for the following conjecture: 

Conjecture 13 Let T be a unital quantum channel on 
a system, or more generally a map T : A — > B such 
that for all input states p A , S(p) < S , (T(p)). Then, for 
sufficiently large n, T® n is arbitrarily well approximated 
by mixtures of isometries (unitaries in the unital case). 

In general, T® n is arbitrarily well approximated by 
mixtures of partial isometries between A n and B n (i.e., 
unitary transformations between subspaces of systems A n 
and B n ). 

The appropriate distance measure for quantum chan- 
nels T and T' to be used here is 

llT-T'Hcb = max ||(id®T)cA- (id®T>||i, 

the completely bounded norm (cb-norm) \2d_ l. 

In further support of this conjecture, we now outline 
a proof for a weaker version of it, where the comparison 
of T®" and the mixture T" of unitaries is done not in 
the worst case over all input states, but with respect to 
a single state cA®": we want \\p® n - (id <g> T')0® n |[ 1 < e. 
The significance of such a statement is that if cf> is a pu- 
rification of a mixed state a on A, and {pk,4>k} is any 
source ensemble on A n with average a® n , then the av- 
erage error, Y^kPk\\T® n (4>k) — T' ((f)k)\\ v is also bounded 
by e. 

For simplicity, we assume the S{o~) is strictly smaller 
than S(T(a)). Note that one could always modify the 



channel trivially by padding the output with a sufficiently 
maximally mixed state, to enforce this condition. 

We can write down a purification of pt in Schmidt 
form, 

W A ' BC = Y,V^m A ' B \j) C \ 

j 

with orthogonal states and By assump- 

tion, 

xa> ■■=s{a)-Y,q ] s{^ A ') 

3 

<S(T(a))-J2ljS(^) =: X b, 
j 

so we can choose a number R between these two values. 
Now we go through the random coding argument in the 
proof of theorem^ but actually in the form of the second 
case considered in the proof of theorem |3] Since here we 
assume that we have uniform distribution on the j, there 
is no need to restrict to the set of typical sequences. 

What we get are random codes of TV = 2 nR sequences 
J = ji . . . jn , such that the corresponding states ijjjiP) 
(dropping superscript n as before) form a good code for 
Bob. That means, that for the superpositions 

(resulting from projecting the system C onto a vector 
\t$)), we obtain, as at the end of the proof of theorem|31 
that fl A S3 jj J^p i>j{f3) ■ And exactly as there, we can 
use lemma IT21 to conclude that $ A S3 a® n with respect 
to trace distance. Both approximations in fact with high 
probability over the choice of the code. That means that 
there is a purification |C#) A B of a® n such that -d A B S3 

/■A'B 

The connection to channels is now made by going the 
Jamiolkowski isomorphism in the other direction: for the 
well-behaved & as above, there exists an isometry V$ : 
A —> B such that |C#) = (1 ® V^)|ci>®"), and hence our 
candidate mixture of unitaries is 

T'(<p)= v*V*<pVl 

$ well-behaved 

where the w$ are probability weights. They are obtained 
as essentially \ c (t$\ip® n ) A BC \ , normalised to the prob- 
ability of the well-behaved set. It is then straightforward 
to verify that indeed ||pf " - (id ® r')^® 71 ^ is small. 

We want to close this section with a few comments 
on the difficulties encountered in the attempt to extend 
this argument to a proof of our conjecture. Clearly, the 
vectors \t&) determine Kraus operators D$ for T® n via 

C (U\ip® n ) A ' BC = (1(8)^)10®"). 
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Because the cb-norm difference of T® n and T" can be up- 
per bounded by II D# — V$\\ (with the operator norm 
|| • ||), it is tempting to aim at making the latter quantity 
small. But with our fixed source a, we can say something 
about the difference D$ — Vg only on the typical subspace 
of the source, and even there only on the average — it 
is conceivable, and consistent with our result, that the 
operator norms of the — V§, when restricted to the 
typical subspace, are all large. In addition, in the above 
proof we can make our statements about •& only "with 
high probability" and the probability distribution is also 
determined by the source a. 

VIII. DISCUSSION 

We have presented a class of very general procedures to 
distill singlets and cat states, both in tripartite and mul- 
tipartite settings. These procedures give universally the 
largest EPR rate distillable between any pair of parties 
in a multipartite state, when the other players cooperate. 
For three parties, this problem and its solution is equiv- 
alent to the previously considered entanglement of assis- 
tance. We have shown how GHZ (and higher cat state) 
distillation protocols can be constructed from common 
randomness distillation schemes by "coherification" . It 
should be clear that a good number of variations of what 
we have shown here can be done. As a consequence, we 
could solve the problem of quantum channel coding with 
maximal classical help from the environment. 

We stress that even though we look here at pure state 
transformations, we did not attempt "entanglement con- 
centration" , which is meant to generalise the asymptotic 
theory of bipartite pure states: there we have asymp- 
totic reversibility, and demanding this leads to the hard 
MREGS problems. Instead, we do "distillation" of spe- 
cific states (which surely will be interesting), embracing 
the possibility of irreversibility, but going for the maxi- 
mum rate. We think that understanding these problems 
will remain central even assuming availability of a com- 
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